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AESTRACT 

This study tests the hypothesis that optimum testing 
style will result in more efficient test performance by 4-year-olds, 
thus diminishing differences between scores of lower class children 
who have and have not attended enrichment programs- A review of the 
literature discusses the effects of testing situation variables, 
language differences, dialects, ethnicity, motivational factors, and 
manipulation cf pretest conditions on the scores of low income 
children. In tie present study, scores on the Illinois Test of 
Psycholingaistic Ability (ITPA) were compared, since it was assumed 
that language scores were particularly likely to reflect the effects 
of optimum resting style. Data included the ITPA scores of (1) 
children participating in the Family Development Research Program 
(PDRP) , an enrichment program for infants to school age children from 
low income families; (2) carefully matched control children who did 
not participate in the FDRP program but who were tested under FDRP 
conditions; and (3) children who had served as controls for other 
enrichment programs, in which testing was not done under FDRP 
conditions. Children in the FDRP experimental and control groups were 
not tested until they were at ease in the testing situation, with all 
attempts made to optimize test conditions. Although FDRP control 
children scored below children in the FDRP enrichment program, they 
performed only slightly below national norms on three of the ITPA 
subtests used in the comparison, and considerably above national 
noras on the other four subtests. Control children from the other 
enrichment programs generally performed below national norms on all 
subtests for which cciparable data were available. Results and 
implications for testing and intervention programs are discussed. 
(SB) 
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EOUCAHON STATED OO NOT NECESSARILY REPRI- 

SE NT OFFICIAL NATIONAL INSTITUTE OF 
EDUCATION POSITION OR POLICY 

Kffects of Testing Style on Language Sc;ores of Four-Year -Old 

^4 1. 

^ Low-Income "Control" Children in Intervention Projects 

jr\ Alice S Honig, J.R. Lally, L. Webb, F. Armstrong, & N,S. Wright. 
X) 

^ The^^oblc^ 

sr\ The purpose of this paper is to raise questions about psychological testing 

— ^ i'-tyles. Two conditions can possibly have been created by present testing techniques. 
Jj 

The f.irGt condition is the possible artificial rt .triction of scores for low 
income control populations because of limited testing techniques. The second 
condition is that estimates of success of enrichment programs may be based on 
luoaninglGss differences found between a control group with artificially restricted 
, ... scores ana an experimental group with normal scores - thus propagating enrichment 
techniques that might be of little value. 

Test manuals often give sage advice about building rapport v/ith young chil- 
dren and tlie importance of such rapport in order to insure optimal test scores 
for any youngster. For example, Cattail (1960) has urged that the following test 
conditions be adhered to as closely as possible in the testing of young children. 
The child should not be sick, tired, slf^py or in an antagonistic or unhappy 
mood when tested. The confidence of young children cannot be gained by 
verbal explanations but must be built up through an easy confident manner. 

M Disapproval of a child's actions can invalidate the score on his test. 
Testers should always be alert to boredom and counteract it with praise, 
encouragement or quick surprise presentation of new toys. An introductory 
toy should be used that will insure success but still arouse interest 
(pp. 74-75) . 

However, there has not always been sufficient adherence to such precepts 
and the proble.T. of the effect of testing styles on scores has not received as 
much attcijtion ay. it warrants. Testing style is a particularly critical issue 
to conr,id:;r v;}i'.-?n one regards the large number of intervention projects which use 
tscoro.'i r.j:om eontrast or control group^^ in order to demons t:r-ito g«^ins of experimental 
ERJjCl- Paper presented at the biennial meetings of the Society for Research in Child ^ 
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^ yroups. OtU'.u the control group scores reported are well below national norms. 
Enrichment group children performing at the norms are then often considered as 
"superior" in functioning as a consequence of program input. But are these dif- 
ferences valid? Many researchers (Labov, 19 ; Daratz S Baratz, 19 /C} and 
Valentine, 19 ) have argued the cultural inappropriateness of some tests. But 
other issuer also need to be considered. Lally (1970, 1972) has reported that 
within the past few years, over one hundred researchers engaged in intervention 
programs with children from zero to three years formed a group which met three 
to four times a year with hopes that these meetings would help them better under- 
stand each other's work. It was discovered early in the dialogue that the stan- 
dardized test procedures recommendal on most nationally recognized tests were ^ 
not sufficient to ensure comparable administration from <,ite to site. Suggestions 
were made to remedy the situation. For example, a videotape of each test battery 
would be agreed upon by the group and then sent from site to site to ensure 
quality control. Unfortunately, this remedy and others, although sorely noeded, 
were not acted upon. The meaning of test scores at different sites is still dif- 
ficult to judge and specific comparisons of test results across sites are hard 
to make without additional information about testing styles and testing conditions. 
Testi ng Situation Variables and Children's Scores 

Inquiry into testing variables which may influence children's scores and 
thus obscure sought-for relationships between program input variables and child 
outcome measures have focused on variables aside from those obviously associated 
with adequate knowledge of test content or of testing procedures with young chil- 
dren. Indeed, inept abilities of testers to gain rapport with young children have 
long been found to influence the kind and validity of data obtained not only in 
testing situation-; but in interviews as well ('/arrow, 1960) . 

Languag e differences, dialects, and ethnicity . When the child's commonly 
used hone language or dialect differs from the standard language in which a test 
j.£- presented, then different degrees of handica]; may accrue in test scores for 



3 

the children. Ncdler & Sebera (1971) have demonstrated this by giving the PPVT 
IQ tost (Poabody Picture Vocabulary Test) both in Spanish and in English to three- 
year-old bicultural children. As much as a 28 IQ point different in scores was 
obtained in favor of the children's performance on the Spanish version in con- 
trast to the English form of the PPVT.. 

Dialect differences, on the other hand, have been found to have more variable 
effects on test scores. Quay (1970) had black male testers administer the Stanford- 
Binet IQ test either in standard English or in black dialect to f our-and-one-half- 
year-old Project Head Start children, under differential conditions of reinforce- 
ment — either tangible candy reward or praise. It was expected that dialect plus 
the candy reward should optimize test performance of the children. No differences 
were found for any of the conditions. That is, intelligence scores, which ranged 
from 95.4 to 97.1, were affected neither by the dialt^ct used by the examiner nor 
by the type of reinforcement presented. Quay suggests t: ■ t different dialects may 
pose more of a problem for speech production tests but that children who speak 
a dialect predominantly and also have much exposure to standard English models 
seemed to have no difficulty in comprehending either the dialect or standard 
English spoken by the testers. 

With respect to speech production research, Resnick, Weld, and Lally (1969) 
have found that the standardized testing situation markedly reduced the frequency 
of expressive language among disadvantaged black infants. The total number of 
toddler vocalizations and words spoken was much higher in the pre-testing situa- 
tion whsn mother and infant v;ere alone in the testing room with many toys than 
in the actual testing situation. 

Motivational factors and test scores . Zigler and his colleagues have examined 
tJie cffoct:^ of motivdtional variables on test scores in a series of papers designod 
to show th.? \.njlnerabiij ty ol cognitive perforn:- ncei to such factors (Butterfield s* 
Zjaler, 1970; Zigler & Butterf j^eld , 1968; Zigler, Hogden, & Stevenson, 1958). Sow?. 
of this work (Zigler, Butterfield, & Capablanco, 1070) focused on the detrimental 
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on:cH:t.i; oi'. ordinary tost situations on retarded children starved for positive 
social contacts. "It appears that the retarded child's atypically high need 
for social reinforcement competes in the testing situation with the production 
of correct responses and, thus, results in intelligence test performance lower 
than that dictated by the child's intellectual resources. To the extent that 
tho. child's need for social reinforcement is ameliorated, this leads one to 
expect an increase in IQ" (p. 261) . 

Zigler and his colleagues (1973) later investigated and demarcated motiva- 
tional factors which led to the less adequate performance of economically dis- 
advantaged children on the PPVT.. Their motivational hypotliesis contained the 
assumption that children from poor families are much more wary of a tester than 
are nondisadvantaged children. Their prediction that a sample of . -four-and- 
one-half-ycar-'Old children enroUedin Head Start Centers would increase their 
Peabody IQ scores far more on re test within a week or two than would a contrast 
group of middle and upper-middle-income children was confirmed. Thus^ increased 
comfort deriving from familiarization with tester seemed critical to the signi- 
ficant positive IQ test-retest score gains of the disadvantaged children in 
comparison to their contrast group where no such gains accrued from first to 
second test. Later this study was repeated with additional conditions, one of 
which was a play session added prior to initial testing in order to allay situa- 
tional anxiety. The authors found that when 

^situational wariness is overcome, as by a pleasant play period, the initial 
IQ score becomes significantly higher than when no effort is made to dis- 
sipate the child's general anxiety. The finding that disadvantcTged children' 
performance, as compared with that of nondisadvantaged children, was more 
iritiaonced by the nature of their interaction with the E is consistent with 
earlier findings that socially deprived children are more sensitive to 
variation in the emotional climate created by adults tb-in are nondeprived 
children" (p. 301) , 5 



Connors Eisenborg (1966) have nlso reportocl that Pcabody IQ gains achieved 
by children In <i sMiniiUT Ho/id Start program reflected motivational characteristics 
of children rather than superior cognitive functioning. Sacks (1952) has reported 
that Stanford-Binet intelligence scores vari- .( from test to Lest as a function of 

the experimentally established social rclatior^ ^hip between child and examiner. 

Manipulation of prctoot conditions . Sensitivity to the importance of chil- 
dren's comfort as a factor influencing test scores has led some researchers in 
recent years to create conditions which will increase the familiarity and confi- 
dence with which children enter the testing situation. Garber (1975) has reported 
for the Milwaukee Project (which provided intensive enrichment e'<cperience for lov;- 
income black infants whose mothers' mean IQ scores was below 75) that every effort 
was made to ensure that infants who were earlier randomly assigned to the control 
group were comfortable and at ease prior to being tested. 

Palmer (1970) was more specific about his methods. He involved black male 
toddlers in either a carefully sequenced concept training curriculi^ or in a 
•discovery' curriculum to teach concepts such as "hot-cold.," He has clearly 
expressed the philosophy of establishing optimal test conditions, a philosophy 
which indeed may require much staff time and effort. 

No child began assessment until he had met the following criteria for 
adaptation to the Center's environs: (a) mother out of the testing 
c^jbicle, door closed, and child playing with examiner for at least 
20 minutes; (b) examiner judged the child was ready for testing or, 
if uncertain, requested a senior examiner to confirm or disconfirm 
her opinion. 

Most children were sufficiently adapted to the Center by their 
third visit to meet the ai^ove criteria, but some required as many as 
five visits before testing was begun. If during the assessment the 
child at any time ap^ ared to '"o dist ractecl beyond the efforts of 
the oxainincr to hold his attention, testing was discontinued until 
th(» foJ lowinci /init. Chi.lc'ren progres.sed through the measures at their 
' own x^cice and required from 4 to 15 hours to compleLj the battery as 0 
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a function of the characteristics of the individual child and the 
age of testing, (p. 3) 

Palmer attributes the scarcity of any socio-economic differences on 
cognitive test scores and measure to be found among his groups (at 3 years, 
8 monthp of age) to his concern for and attention to optimal testing con- 
ditions for all children. He concluded that "It is possible that middle- 
class children are more adapted to conditions where such testing ordinarily 
occurs and consequently perform better in those situations, but that the 
lowe^r-class child when given the opportunity to adapt performs as well'* 
(p. 9). 

Sometimes the manipulation of pretest conditions to maximize children's 
comfort unexpectedly points up the vulnerability of test scores to such manip- 
ulations, Costello & Martin (1969) carried out a language enrichment pre- 
school program for black f our-and-one-half year olds in a low-income housing 
project in Chicago over a period of several years. The first two •'waves" of 
children received the Stanf ord-Binet IQ test shortly after admission to the 
program. Wave III children in the third project year were not pretested until 
6 weeks after the program was under way. This initial adjustment period had 
a very favorable effect on scores achieved. For example, the mean IQ pretest 
score for Wave I children was 88,1; for Wave III children the niean pretest 
score was 9^-7 • Wave III children had been carefully chosen to represent an 
even more disadvantaged group of children than those in Waves I and II, Yet, 
this difference in scores favored Wave III children who were given an initial 
adjustment period prior to administration of the pretest. Costello & Martin 

(1969) observed that: 

Although we had selected a larger number of children of low competence 

level than occurred in the unbiased samplings of Wave?; I and II, our 
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observation as well as research evidence suggested that the positive 
Irfluences of testing procedures on the means would be greater than 
the negative influence of the increased proportion of low competence 
subjects. ... The more supportive test procedures of Wave HI shifted 
the score distribution most favorably for middle scoring children, less 
so for high scorers and least for low scorers, (p. 13) 
Such findings have been very important in increasing our awareness 
and concern with test conditions, particularly for initial conditions under 
which pretest expermental group data is gathered for a program and for the 
testing Situation and climate which exist for the "control group" child 
whenever he or she is tested. 
Hypotheses 

we have hypothesized in the present study that differenced, in testing 
styles which optimize conditions will result in more children performing 
more effectively and thus tend to decrease the probability of very low 
scores amorg lo„-income control group children. Thus the probability of 
finding large differences between enrichment groups and their controls 
should be minimized compared with the lo« scores typically reported for 
children from poor socio-economic backgrounds who have not participated 
in an enrichment program. 

we have further hypothesized that since language production has been 
found particularly vulnerable to comfort and familiarity factors, then an 
optimal testing style applied to a language test might more effectively 
reveal the positive influence of this optimal style on control children's 
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Method 

The testing philosophy in the Family Development Research Program 
(F'M<P) (Lally, 1974)--an enrichment program from early infancy to school 
age for low-income f amilies--has been firmly established as one which 
optimizes the testing conditions as much as is humanly and creatively 
possible both for children in the project and for their carefully matched 
controls from low-income families. No child is tested unless and until 
he or she appears very comfortable and is at ease in the testing situation. 
This may necessitate feeding children and scheduling several visits in 
order to begin and to complete an assessment battery. Seven subtests of 
dhe Illinois Test of Psycholinguistic Ability (ITPA) (Kirk, McCarthy, 6c 
Kirk, 1968) are adm^'.ni stered both to the children in the experimental pro- 
gram and to their longitudinal controls from age four onward. Thus, 
neither group has been exposed to the ITPA prior to this assessment. It 
should be made clear that scoring criteria for the ITPA have not been 
altered in any way, but on the contrary have been adhered to scrupulously. 
Table 1 provides a description of the ITPA subtests. 

Insert Table 1 about here 

Subjects 

In order to contrast scores of control group four-year-olds from the 
Family Development Research Program with scores of comparable low-income 
peers, have used ITPA data from a variety of program sources. Some of 
these programs used the 1961 version of the IT^\. Since the content validity, 
characteristic items, and standardization characteristics of subtests with 
earlier and revised versions are exceedingly close, the 1968 ^psycholinguistic 
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ago noriiLs iind jjcnlcd ticore norms derived from raw ecorea have been used, 
WhtMi project reports only provided one set of scores, the ITPA norm tables 
permitted a comparison to be made among the three types of scores for all 
projects rce^irdloss of the original form in which the data were presented. 
The data come from the Family Development Research Program (FDRP) ao 
well as ■: other projects with varying numbers of children tested at com- 
parable ages: (1) 16 children from the Durham Education Improvement Program 
(EIP) (Anastasiow, Stedman, & Spaulding, 1968), an educational interven- 
tion program designed for black and for white children living in poverty; 
(2) 101 children from the Karnes et al. (1969) language intervention pro- 
ject, in which low-income children, both black and white, were given sub- 
tests of the Illinois Test of Psycholinguistic Ability at age four just 
prior *:o participation in five different intervention programs'; (3) 18 
black children at f our-and-one-ha If years of age from the control group 
of the Milwaukee Project (Heber, Garber, Harrington, Hoffman, & Falender, 
1972); (4) 20 disadvantaged f our-and-ond-ha If black children in Sprigle's 
Learning-to-Learn Project (Van de Riet, 1972); (5) 15 disadvantaged four- 
and-one-half-year-old children in the Bereiter-Enge Imann patterned drill 
language i.,tervention program (1966) who received two ITPA subtests prior 
to admission into the program; (6) 132 children from the Horner Preschool 
Program for black f our-and- -ha If nar-old children from a low-income 
housing project in Chicago v JStcl j & Martin, 1969). 

Data from the children in the contrast programs have been compared 
with the ITPA subtest scores from the Family Development Research Program 
where optimal testing is the y-ule. Brief background information for some of the 
control children in the FDRP project provided in the Appendix shoula dispel 
anV tendency of readers to assume that the FDRP control children come from 
more advantaged backgrounds than the other groups. 

10 
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lawsuits 

Tlu» roiiu Us of thuiic* compar Lson j; nn* sihown In Table 2, ExiJminal lon 
of the ntnlod ITPA scores available for cbildrcn in the controjjt programa 

Insert Table 2 about here 

indicates that children in the control group of the FDRP project 

performed only slightly below national norm?; on three of the TTPA sub- 
tests used in the comparison, and considerably above the national norms 
cn the other four subtents. Children in the contrast groups performed 
for the* inoni pnrl bolow the nntlonal norms on all subtests for which 
comparablo data were avtiiiable. 

Children who have not undergone enrichment procfn'ures in the comparison 
groups are seen to have performed more poorly than VliD? contro? group four- 
year-olds. The Durham Program controls performed below national norms on 
six subtests and slightly above these norms on three subtests. Children 
from the Karnes programs at age four, prior to enrichment program entry, 
performed below national norms on five subtests and at the norm on one 
subtest. In the Milwaukee Project, controls performed below national norms 
on eight subtests, above on two. In the Sprigle program, controls performed 
uniformly below national norms on the four subtests given. In the Bereiter- 
Engelmann program, controls performed well below national norms on the three 
subtests given. In the Horner Preschool program, controls performed markedly 
below national norms on both subtests administered. 

Figure 1 graphically demonstrates the superior functioning of FDRP 
control children on ITPA subtests. The scaled scores of these children 
aro sct;n to li(^ near ur above the mean scaled scores of 35, which has been 
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not i\u tlH' St ,nHl.iril I iiatlDiun norm (or cMOh ii^.? {.'roup, TIa* bnr grflph 

1 n M i» r t Y 1 ^» u r t* 1 about I u r ^ ; 



iiulic.'HoH n ni;irkoii IcMultMuy lor control tof»t sroros from other projects 
to fall at or well he low thia Uiitionnl «calcd score mean of 35. 

Flt;urc 2 contraiits scalod scores of FDRP control ^roiip children with 
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scores achi.'ved by oxper iinont;* 1 children in all the comparison programs 
Including the FDRP pro^;rnm. Within the FDRP program, in each case, the scores 
controls fell below the scores of experimental four-year-olds. Yet if we 
examine FDRP control group scaled scores in comparison with the scaled 
scores achieved b> experimental children in the other comparison programs, 
we see that the FDRP control children perfoi^ed on many of the available 
subtest comparisons at a level superior to experimental children in com- 
parison groups. There arc seven subtests for which comparative data are 
available. In Figure 2 we note that on the Auditory Reception subtest 
and on the Granroatic Closure subtest, FDRP controls achieved a higher mean 
scaled score thnn experimentals in one of the two other programs with avail- 
able comparison data. On the Visual Reception subtest, FDRP controls 
achieved ^ higher mean scaled score than experimentals in two of the three 
other programs v;ith available comparison data. On the Auditory Reception 
seutest and on the Verbal Reception subtest, FDRP controls achieved a higher 
mean scaled score than experimentals in three of the other four programs 
with available comparison data. Only on one subtest, Visual Closure, did 
FDKP controls achieve a lower mean scaled score than that of the one other 
experiment:: 1 comparison group available, 
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Table 3 illustrates the consequences of superior control group 
functioning on different scores calculated between control and experimental 
groups for each program. 

Insert Table 3 about here 

We have subtracted the scaled scores achieved by control group children 
from the experimental scores of children in each project separately for 
each subtest. Data from only five of the comparison groups were used 
sincj experimental scores were only available in the other groups for experi- 
mental children well beyond the age of 4-0 to 5-0, the age range with which 
this study is concerned. It will be noted that in the Milwaukee project 
differences between experimental and control children were uniformly greater 
for all subtests than in any other project. By contrast, differences between 
experimental and control children in the FDRP project on each subtest were 
very modest, although uniformly in favor of the experimental group. Dif- 
ferences in the Durham project scores are difficult to interpret, since 
these scores do not represent experimental-control differences but represent 
pre- and post-intervention scores of the same children. The mixed direction 
of the calculated pre-post differences might lead one to infer that interven- 
tion had both detrimental and positive effects on the ITPA subscale perfor- 
mances of the children involved in the Purham project. 
Discussion pnd Conclusions 'J 

The above results indicated that control children from a low-education, 
low-incomG population, where vulnerability in language areas is considered 
particularly likely, performed at satisfactory levels on subtests of a 
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nationally standardized language assessment instrument, the Illinois Test 
of Psycholinguistic Ability. These outcomes were associated with a care- 
fully and consistently applied method of optimizing test conditions for 
childran in both the experimental and control groups of an intervention 
program. We are thus led to reconsider some of the assumptions and expec- 
tations of intervention projects and their testing programs. 

What are the implications of being below the national norms for the 
majority of control group children in intervention projects--chi Idren who 
are in many cases being tested in ways which we suspect are less than opti- 
mal? If we continue a) to use control groups such as those reported on 
here, and b) we have the children tested by conventionally trained testers, 
often giira<fp«l^ hftuliiifi^s with little training in sensitivity to children 
from different cultdral and economic groups, and c) we also expect to get 
control group scores belo\7 the national norms, and d) indeed we get such 
scores, then we ar-.i creating stereotypic sets for these groups. These 
stereotypic sets might be inaccurate, and there is the possibility that we 
ourselves by our own testing methods are helping to create those stereo- 
typic sets . 

Traditionally, intervention studies have always beer judged by compari- 
sons of experimental and control groups. Let us take the hypothetical case 
of an intervention program where the post -test mean score for the experimental 
group on an outcome measure is 90 and for the control group 70. Is that pro- 
gram of value if ^he control group mean score is "really" 90 also. That is, 
under optimal testing conditions both groups would have scored approximately 
the same. Perhaps we have to re-examine the upper yield of some of our pro- 
grams and intervention models. The maximal achievements of some models may 
in reality not be where we would want them to be. 

14 
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/ AnoL'her point is that we may need to look at new ways to evaluate 
the impact of intervention programs. If, it possible th^t the old control- 
versus-experimental group mocel is not the most appropriate way to look at 
the difference a program has made in the achievements and performance of 
the children who have been involved. What about regional differerjes? 
^Should we be impressed by significant E. vs. C group differences if control 
children in a given region tend to perform quite poorly on a test? Our 
experience in this study suggests that we should keep our primary focus on 
program goals rather than on significant group differences where control 
group scores are regionally very low. In such cases perhaps we should look 
again at the experimental group scores and relate them to expected outcomes 
based on national norms rather than simply compare experimental scores with 
control group scores which reflect "significantly poorer" performance. In 
this way a more realistic assessment of the effects of a given program can be 
made . 

Finally, we need to make recommendations for new styles of testing. 
We need a new level of quality. In addition, we need better techniques of 
quality control. Both of these are necessary to ensure uniformly sustained 
better quality of testing in our programs. One of the difficulties may lie 
in the fact that in addition to the procedures and directions given in test- 
ing manuals, there is a great deal of variance in the behaviors of testers — 
variance which is not addressed in the manuals. Personal styles can affect 
testing O'ltcomes. Another point is that the manual's definition of a "trial" 
has always focused on what the tester is to do, almost in exclusion of what 
the child is doing. The definition of a "trial" should be that the child 
actually tries and not that the examiner simply makes a presentation th;jt 
might not be ;5dequatcly received by tho child. 

15 
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Table 1 



Description of ITPA Subtests for Which Scores are Compared 



Auditory RccK^ption 
(Auditory Decoding) 



Visual Reception 
(Visual Decoding) 



Auditory-Vocal Association 



Visual-Motor Association 



Verbal Expression 
(Vocal Encoding) 



Manual Expression 
(Motor Encoding) 



Grainmatic Closure 



Visual Closure 



Auditory Sequential 
Memory 



Visual Sc'quential Memory 



This test assesses the ability of a child to understand 
verbally-presented increasingly difficult mat rial. The 
response required is a simple head shake or uoC or Yes-No 
answer. Sample question, "Do dials yawn?^' 

This test measures a child's ability to understand the 
meaning of a visual symbol. A stimulus picture X is on one 
page, and 4 response pictures are on another page. The 
child is to find the response picture conceptually similar 
to X. For example, X might be one kind of table. 

The organizing process of using language symbols meaning- 
fully is assessed in this sentence completion test which 
presents increasingly difficult analogy items, such as 
••Daddy is big; baby is 

This is a picture association t.;iSt. A single stimulus 
picture is surrounded by A response pictures., one of which 
is associated with the stimulus pictures. The child has 
to choose a response picture most closely related to the 
stimulus picture; such as a sock,, related to a stimulus 
picture of a shoe. 

This test assesses the ability of a chilJ to express as 
many of his own ideas and descriptions about a familiar 
object, such as a ball, as he can. He is asked, "Tell 
me all about this." 

This test asks for gestural manipulations by a child to 
as5;ess his understanding. H'* is required to pantomime 
actions appropriate to each pictured object, such as a 
telephone . 

The child's ability to use grammatic forms of Standard 
English is assessed. Syntax and grammatic inflections 
such as plurals and past tense forms are assessed in con- 
junction with pictured items. Example, "Here is a knife; 
here are two •" 

This test assesses a child's ability in 30 seconds to detect at 
identify common objects which are each partially hidden in 
a complex pictorial scene. 

This test is an assessment of the child's ability to repro- 
duce from memory sequences of digits increasing in length 
from two to eight digits. 

This test assesses whether the child can, with chips 
of figures, create increa^^ingly lengthy visual patterns 
identical in sequence to nonmeaningful figures presented 
for 5 seconds prior to the test. 



ERIC 



' Ttie subtest namo in parentheses refers to the name used In the 1961 version of thv. 
,fA. The other name refers to the same subtest in the 1968 version of the ITPA. 
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Table 2 



ITPA ScalQcl' Scores Achieved by Control and Experimental Children 
in Seven Intervention Projects 





Mean 




Aud. 


Vis. 


Aud. 


Group 


Age 


N 


Recep 


Recep 


Assoc 


Family Developajnt 












Research Program 










Exp. 


4-0 




36.4 


41.0 


37.5 


Controls 


4-0 






39.3 


33.7 


Durham EI? 












Exp. 


5-0 


16 


31.3 


38.5 


29.7 


Controls 












Group 1 


4-6 


7 








Group 3 


4-4.9 


9 








Total 


4-5.4 


it) 




Jo#iJ 


33.5 


Karnes' 












Programs 


Too 










Txp. 


Old 










Controls 












Trad. 


4-5.4 










^ora. 


4-1.3 


16 








Montes 


4-2.2 


13 








Araelior. 


4-4.6 


24 








Direct 


4-3,1 


23 








Total 


4-3.6 


101 


35.0 


34.5 


30.0 



Co^^t^ols 4-10 20 29.0 26.0 30.5 



Vis. 
Seq 



43.3 50.2 34.8 40.1 

40.4 47.3 34.2 38.2 



35.3 33. S 40.3 24.8 41.2 30.5 



34.0 30.5 37.5 28.0 39.0 34.0 



29.5 34.0 30.0 

Milwaukee 

Project 

(Heber) 

Exp. 4-7 18 36.8 46.1 40.7 41.1 42.3 43.9 42.4 43.5 40.6 39 9 

Controls A-7 18 28.8 37.8 29.4 31.8 34.4 38.2 28.7 31.8 33.1. 29.3 

learning to 
Learn 
(Sprigle) 

Exp- 5-0 22 33.3 29.5 38.0 



36.6 
31.0 
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Table 2 (cont.) 



C-^-. T ^ ^i^- verb. Man. Gra.. Vis M 

Age Recep Receo. Assoc. A«.n.. .... l^' ^l^' 



Bereiter- 
Engelaanr 
Progran 
Exp. Too' . 

Old 

Controls 4-4 15 



Horner 
Preschool 
Program 
(Costello) 

Exp. 4-11.7 i5o: 



Controls 4-5.2 ' 132*^ 



^ K for Auditory Association 
b 

N for Verbal Expression 

c 

K for Auditory Association 

d 

K for Verbal Expression 



Recep Recep. Assoc. Assoc. Exp. Exp. clos! Clo;. Zl ^ 



2^-0 30.0 25.0 



o/b 25.2 30.7 

94 



" 26.0 26.5 



s 



Table 3 

Differences Between Experimental and Control ITPA Scaled Scores 
of Children Within Each Program Group 



Aud Vis Aud Vis Verb Man Gram Vis Aud Vis 
Group Recp. Recp Assoc Assoc Exp Exp Clos Clos Seq Seq 



Fasily Development Research 

Program 2.3 1.7 3.8 2.9 2.9 .6 1.9 

Durban: Eip'' -1.7 2,5 -3.8 1.3 3.3 2.8 -3.Z 2.2 -3.5 



Milwaukee Project 

(Keber) 8.0 8.3 11.3 9.3 7.9 5.7 13.7 11.7 7.5 10.6 

Learning To Learn 

(Sprigle) 4.3 3.5 7.5 5.6 



Horner Preschool Program *.8 4.2 

(Ccstello) 



Ages of children in all groups range between 4-0 and 5-0.' 

In this analysis experimental children's pretest scores serve as control scores. 



REPRESENTATIONAL 



RECEPTION 



ASSOCIATION EXPRESSION 



VIS ^ 




AUTOMATIC 



CLOSURE 



SEQ. MEMORY 



2 45 



1 FAMILY DEVELOPMENT RESEARCH PROGRAM 
ERICHAM EDUCATION IMPROVEMENT PROJECT 



3 KARNES' PROGRAMS 

4 MILWAUKEE PROJECT 



5 LEARNING TO LEARN 

6 BEREITER-ENGELMANN PROGRAM 

7 HORNER PRESCHOOL PROGRAM 
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REPRESENTATIONAL 



RECEPTION 



ASSOCIATION 



EXPRESSION 



AUTOMATIC 



CLOSURE 



SEQJEMORY 




.1234 28 123« 1234J6 a 34 t234$6 12J4 1234 

o r DEVELOPMENT BESEARCH PROGRAM 5 LEARNINS TO LEARN .lihe, FDRP c.nlrol or 90 

EDUCATION IMPROVEMENT PROJECT S HORNER PRESCHOOL PROGRAM 21 



absent from the home and rarely works since he lacks skills. 30 



Appendix 

Sample Profiles of Control Group Mothers in the 
Family Development Research Program 

4274 is a pretty 25 year old mother of three children who is separated 
from her husband. She works at a downtown store as a waitress and receives 
some .support money from the childrens' father. She has attended ABC but has 
not received her high school diploma. She has high hopes for her children 
and would like to move from Syracuse. She would like to become a cosmo- 
tologist some day. Occasionally, she works as an expensive party prostitute. 
Her relationship with her exhusband is sometimes violent and the children are 
very aware of this. 

4205 is 19 years old and the mother of three children. She is single 
and is an inexpensive prostitute. She has abandoned her children and she 
comes home only when she is pregnant. The grandmother is now the chief care- 
giver since the state took the three children away from 4205. The extended 
family setting has seven children in all. 4205 has a drug problem in addition 
to frequent births and police troubles. She was enrolled at the Adult Basic 
Learning Center (ABC) trying to get a high school equivalency diploma. Family 
income continues to be mostly through social services and welfare benefits. 

4308 is married with three children. At 20 she is still unable, to cope 
with everyday living. She is very lethargic, unkempt, and looks and says she 
feels unhealthy. Her husband is unable to read and write well. The maternal 
grandmother over-sees the whole family unit. 4308 has had three miscarriages 
in the past 18 months and appears to be in very poor health. The grandmother 
is the chief caregiver with the children. 4308 lacks motivation even to dress 
herself in the morning. She has attended ABC program but as yet has no high 
school diploma. Her sole source of income is welfare. Her husband is often 



